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Abstract 

PDFgetX3 is a new software application for converting X-ray powder diffraction data 
to atomic pair distribution function (PDF). PDFgetX3 has been designed for ease of 
use, speed and automated operation. The software can readily process hundreds of X- 
ray patterns within few seconds and is thus useful for high-throughput PDF studies, 
that measure numerous datasets as a function of time, temperature or other environ- 
ment parameters. In comparison to the preceding programs, PDFgetX3 requires fewer 
inputs, less user experience and can be readily adopted by novice users. The live- 
plotting interactive feature allows to assess the effects of calculation parameters and 
select their optimum values. PDFgetX3 uses an ad-hoc data correction method, where 
the slowly-changing structure independent signal is filtered out to obtain coherent X- 
ray intensities that contain structure information. The outputs from PDFgetX3 have 
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been verified by processing experimental PDFs from inorganic, organic and nanosized 
samples and comparing them to their counterparts from previous established soft- 
ware. In spite of different algorithm, the obtained PDFs were nearly identical and 
yielded highly similar results when used in structure refinement. PDFgetX3 is written 
in Python language and features well documented, reusable codebase. The software 
can be used either as standalone application or as a library of PDF-processing func- 
tions that can be called on from other Python scripts. The software is free for open 
academic research, but requires paid license for commercial use. 

1. Introduction 

With the increased interest in producing and exploiting nanostructured materials, it is 
necessary to expand the methods that go beyond crystallography (Billinge, 2010) for 
characterizing their atomic scale structure. In recent years, total scattering and atomic 
pair distribution function (PDF) analysis (Egami & Billinge, 2012) has emerged as 
a popular and powerful tool for this purpose (Billinge & Kanatzidis, 2004; Billinge, 
2008; Young & Goodwin, 2011). To satisfy this demand a number of X-ray and neutron 
beamlines dedicated to or optimized for, such measurements have emerged (Egami & 
Billinge, 2012), and manufacturers of laboratory X-ray sources are also beginning 
to market instruments for this kind of measurement. Especially with the use of 2D 
detectors, modern beamlines are yielding total scattering data at unprecedented rates 
allowing detailed parametric and time-resolved total scattering studies to be carried 
out in special environments (Chupas et al, 2004; Chupas et al, 2007; Jensen et al, 
2012; Redmond et al, 2012). A bottleneck in further growth of the method is now 
the lack of robust and automatable software for creating PDFs from the raw data, 
currently a computationally and user-intensive process (Egami & Billinge, 2012). 

This can be illustrated by considering one of the most widely used software pro- 
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grams for this purpose, PDFgetX2 (Qiu et al, 2004). The program offers users a great 
deal of flexibility and control in choosing exactly which corrections to apply to X-ray 
scattering intensities in order to convert them to PDFs. However, due to the myriad 
of options available to users as well as the esoteric nature of many of the corrections 
(Egami k, Billinge, 2012), PDF generation requires considerable user input and exper- 
tise in arcane details of the technique. Although the software has a graphical user 
interface, it is a time intensive process to carry out the corrections, with many possi- 
bilities for input errors, and the process can't be easily automated for high throughput 
of many data sets. 

In this paper, we describe a new software program, PDFgetX3, which implements 
an ad-hoc data reduction algorithm (Billinge & Farrow, 2012) that requires little 
user input, generates PDFs in a fraction of a second, and can be straightforwardly 
automated to batch-process thousands of PDFs. Here we show that in the physically 
relevant region of the PDF it produces quantitatively accurate PDFs that are the same 
as those obtained using PDFgetX2 for the cases shown, and which yield refined struc- 
tural parameters that are also indistinguishable from those refined from PDFgetX2 
determined PDFs. 

The intensities measured in a total scattering experiment, I m (Q), can be expressed 
as (Billinge & Farrow, 2012) 

I m (Q) = a(Q)I c (Q) + b(Q), (1) 

where I C {Q) is the coherent scattering intensity, which contains all of the structural 
information about the sample, and a(Q) and b(Q) are multiplicative and additive 
corrections to the measured intensity, which do not contain structural information 
(Billinge k, Farrow, 2012). Examples of the additive contributions are incoherent 
Compton scattering and background scattering from the sample container. Exam- 
ples of the multiplicative contributions are sample self-absorption and polarization of 
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the X-ray beam. The approach used by PDFgetX2, and other PDF data analysis pro- 
grams, is to apply known corrections to the I m (Q) to obtain the coherent scattering, 
I C (Q), which is transformed into the structure function, S(Q) according to 



Here f(Q) is the atomic scattering factor and the angle brackets indicate an average 
over all the atom types in the sample. For the neutron case the /'s are replaced by 
coherent neutron scattering lengths, b, in this equation. 



The S(Q) is Fourier transformed into the PDF, G(r), according to (Farrow & 
Billinge, 2009) 



where the quantity F(Q) = Q[S(Q) — 1] is the reduced structure function (Warren, 
1990). The many corrections required are discussed in detail in Chapter 5 of Egami 
and Billinge (Egami & Billinge, 2012), including background subtraction, polarization, 
self-absorption, multiple scattering and Compton scattering, among many others, and 
these are implemented in PDFgetX2 (Qiu et al, 2004) and other similar programs 
(Petkov, 1989; Petkov & Danev, 1998; Jeong et al, 2001; Soper & Barney, 2011). 

It has recently been pointed out (Billinge & Farrow, 2012) that sufficient information 
is known about the general behavior of the correction terms in Eq. 1, and about 
the asymptotic behavior of the resulting F(Q) function, that it may be possible to 
determine a(Q) and b(Q) through an ad-hoc approach where they are parameterized 
and the parameters varied using a regression method in such a way as to yield an 
accurate F(Q) function. Here we describe an algorithm for doing this, as well as a 
software implementation, PDFgetX3, and we show that, indeed, it yields PDFs that 
are not significantly different from those obtained using PDFgetX2. The program is 



S(Q) = 



UQ) - (f(Q?) + (/(Q)) 2 
(f(Q)) 2 



(2) 




(3) 
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fast, easy to use, and highly automatable. 

The method was developed initially for analyzing rapid acquisition PDF data from 
2D detectors, though we show below that it is not limited to this application. It is 
assumed that the 2D data have been correctly azimuthally integrated, and multiple 
frames summed or averaged, to obtain a one-dimensional intensity vs. Q or intensity 
vs. 29 diffraction. A number of integration programs exist for this purpose, for example 
Fit2D (Hammersley et al, 1996). 

The algorithm (Billinge et al, 2011) starts with raw intensity data measured versus 
scattering angle 29. At first, the angle is converted to scattering vector Q and the 
data are re-sampled to an equidistant Q-grid, which is suitable for a fast Fourier 
transformation at a later step and also ensures constant weights in a Q-dependent 
fitting. Note that resampling introduces error correlations between points which can 
be minimized if the data are azimuthally integrated from 2D directly onto a constant- 
Q grid (Yang & Billinge, 2012). The background intensities from an empty container 
are then re-sampled to the same Q-grid and subtracted from the sample data. This 
yields raw intensities from the specimen only; however, which are not normalized per 
incident intensity nor per the number of scatterers. The structure function S(Q) should 
oscillate around and then approach unity as Q tends to infinity, which in practice is 
about Q = 25 A -1 . This means that the difference 

must oscillate around zero and the normalized intensity I(Q)/(f} 2 must be close to the 
normal scattering factor (f 2 )/(f) 2 for any Q. The raw sample intensities are therefore 
rescaled by a least-squares procedure to approach the normal scattering factor curve. 
A physically correct scattering function S(Q) should also display proper asymptotic 
behavior as a derived function F(Q) = Q(S(Q) — 1), which should oscillate around 
zero and approach it with increasing Q. The PDFgetX3 algorithm is based on an 
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assumption that the experimental function S rn (Q) deviates from the correct value by 
a slowly changing additive factor fis(Q) such that 

S m (Q)-l = S(Q)-l + p s (Q). (5) 
The derived function F m (Q) is then 

F m (Q) = Q [S(Q) - 1 + (3 S (Q)] = F(Q) + Qp s (Q). (6) 

Because the correct function F(Q) oscillates around zero, the error term fis(Q) pro- 
duces a slowly changing, Q-increasing background in F m (Q). The PDFgetX3 algorithm 
estimates the background by modeling the (3s{Q) function as an n-th degree polyno- 
mial P n (Q), which is then fitted as QP n (Q) to the F m (Q) function. The corrected 
function F C (Q) is afterwards obtained by subtracting the polynomial fit 

F C (Q) = F m {Q) - QP n {Q). (7) 

The function F C (Q) shows the correct asymptotic behavior with F — >■ for large Q 
values. Finally, the F C (Q) signal is converted to G(r) using the fast Fourier transfor- 
mation as per Eq. 3. 

Since the fitted polynomial is an approximation to the actual error term Ps(Q), the 
corrected function F C (Q) still deviates from the ideal F by 

AF(Q) = F C (Q) - F{Q) = Qf3 s (Q) - QP n (Q), (8) 

and the difference introduces an error signal AG(r) in the obtained PDF. The func- 
tion QP n {Q) is an [n + l)st-degree polynomial approximation to the Q(3s{Q) func- 
tion on a fit interval running from zero to Qmaxinst, therefore we can assume that the 
AF(Q) difference has (n+1) roots that are essentially equidistant between Q = A -1 
and Qmaxinst- The difference function AF(Q) switches between positive and negative 
values at each root, which roughly corresponds to oscillations with a half-period of 
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Qmaxinst/ n - Assuming this to be the maximum Q "frequency" in the difference sig- 
nal AF(Q), the Fourier transformation would introduce non-physical signal AG(r) 
extending up to 

Tpoly = ^1^/ Qmaxinst- (9) 

For a typical RAPDF experimental data, the PDFgetX3 program uses an 8-th degree 
polynomial correction with Q max inst = 28 A" 1 , which yields r po i y = 0.9 A. Assuming 
there are no higher frequency aberrations in the data themselves, the error signal 
AG(r) arising from the polynomial data correction is thus present only for lengths 
smaller than r ~ 0.9 A, i.e., in a region below the shortest bond-lengths in most 
materials. Furthermore, the polynomial fit cannot accidentally remove real structural 
signals from the experimental intensity provided the value of r po [ y is chosen to be 
below the nearest neighbor bond distance in the material. 

Under some experimental conditions such as from lower energy X-ray sources, or 
where the experimental take-off angle is very limited, the instrument Q-range Q ma xinst 
is much smaller and may increase the error extent r po i y to physically meaningful 
distances. In such cases, the degree of the correction polynomial n needs to be reduced 
to avoid overcorrecting the measured data and to keep the value of r po i y small. The 
PDFgetX3 procedure uses Eq. 9 in reverse, and for a fixed value of the error extent 
r po iy and instrument range Q ma xinst, it obtains the degree of correction polynomial as 

Tl r = V poly Qmaxinst /"ft- (10) 

This estimate of the polynomial degree n r is almost never an integer, and rounding it 
to an integer would introduce abrupt changes in the PDF at the half-integer values. 
We would prefer the PDF to respond smoothly to the r po \ y and Qmaxinst parameters. 
To simulate a polynomial fit at an arbitrary floating-point degree, the correction poly- 
nomial is therefore refined twice, for an integer floor and ceiling of n r , and the two 
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fits are then averaged with the weights given by the distance of n r from its integer 
bounds. 

2. Program availability and operation 

PDFgetX3 is written in the Python programming language (Python, 1990). To run, it 
requires Python 2.5 or later with the NumPy and Matplotlib libraries installed (note 
that Python 3.0 is not currently supported). It has been tested to work on the Win- 
dows, Linux, and Mac operating systems. Information on the installation and opera- 
tion of the software can be found at the www.diffpy.org website. The command-line 
version is free for university based researchers conducting open academic research, but 
other uses require a paid license. A version with a graphical user interface (GUI) and 
online version also under development. Information can be found at http://diffpy.org. 

Because the corrections are ad-hoc, only minimal information needs to be supplied 
by the user and this is contained in a configuration file, or can be specified as a 
command- line argument. In the current implementation the program reads data that 
are stored in a multi-column text file with the independent variable, Q or 28 in the 
first column and the measured intensity in the second. If the uncertainties on points 
in the data are known, these may be placed in subsequent columns. The filename for 
the input file, and a measured background file if one wants to subtract it, must be 
specified and if the independent variable is 29 then an X-ray wavelength must also 
be specified. The approximate composition of the sample is also specified so that the 
f(Q) averages may be computed accurately. A background scale parameter, and Q ma x 
to be used in the Fourier transform, are specified, though these have default values 
and the program works when they are not provided. The optimal values of some of 
these parameters may not be known a priori and the program may be run in an 
interactive mode where various tuning parameters may be varied by sliding a slider 

IUCr macros version 2.1.5: 2012/03/07 



9 

with the resulting PDFs updating in real-time in a plot window. In this way a user 
may quickly find the optimal Q ma x and background scale values that are fed back 
to the program. It takes only a few microseconds to complete the corrections on the 
raw data and so the plots update in real-time as the user adjusts the slider. Some 
other parameters may also be controlled by the user to obtain the desired output, as 
described in the manual. 

The program has a powerful Python-based command-line interpreter capability, for 
example, allowing templates to be used for multiple files that have the same filename- 
root but which iterate in some way in the name, for example, by run number. This 
makes the automation of data reduction of many hundreds or thousands of datasets 
rather straightforward. The program is also written with a well documented API 
so that programmers can access the functionality of the engine within home writ- 
ten Python scripts of arbitrary complexity. A screenshot of the program working in 
interactive mode is shown in Figure 1. 
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Fig. 1. Screenshot of PDFgetX3 in interactive mode. The user selected to plot F{Q) 
and G(r). These plots get updated in real-time as the user uses the mouse to move 
the sliders. There are four sliders in this example for Q m i n , Q ma x, Qmaxinst and r po i y . 
The first two are self explanatory. Qmaxinst varies the range over which the correction 
polynomial is fit and r po [ y places an upper bound on the frequency of information 
that the ad-hoc procedure can remove by fitting (see (Billinge & Farrow, 2012) for 
details). If the user wishes to subtract a background signal the background scale 
will also appear as a slider option. 



3. Comparison of PDFgetX3 and PDFgetX2 PDFs 

PDFs have been determined with PDFgetX3 for a number of representative sam- 
ples and compared with those determined from the same data using PDFgetX2. The 
resulting PDFs are compared by plotting them on top of each other. Where possible, 
structural models have been refined to both PDFs allowing a direct comparison of fit 
quality and the values of refined structural parameters from each PDF. The examples 
include inorganic materials such as bulk nickel and barium titanate, nanostructured 
7-alumina, and bulk and nanocrystalline cadmium selenide, as well as crystalline and 
nanostructured phases of the organic pharmaceutical carbamazepine. We choose these 
very different types of materials to show that PDFgetX3 is a robust program that can 
handle all sorts of high energy X-ray data. 
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In all cases, PDFs from both programs are made from the same raw data and, 
where appropriate, use the same input parameters (i.e., Q ma x, X-ray wavelength, 
chemical composition, and container background). All data sets except the 7-AI2O3 
were collected at high-energy synchrotron instruments using the rapid acquisition 
PDF mode (RAPDF) (Chupas et al, 2003) where data are collected on a 2D detector 
and azimuthally integrated to obtain ID datasets, however, the synchrotron is not a 
requirement. PDFgetX3 can handle data from lab-based XRPD instruments and syn- 
chrotron data collected in point-by-point mode such as from high resolution diffrac- 
tometers such as ID31 at ESRF. To show this the comparison is done for 7-AI2O3 data 
that were collected with a Panalytical laboratory based silver anode diffractometer. 

In general, as we will see in the following examples, we find that the PDFs made by 
the different programs look somewhat different from one another at r values lower than 
r po i y . However, in the physically meaningful range beyond the first nearest neighbor 
peak the PDFs look almost exactly the same. In the plots the PDFs obtained by the 
different methods have been rescaled by a constant such that the nearest neighbor peak 
is the same height between PDFs on the same plot. The ad-hoc approach (Billinge <fc 
Farrow, 2012) does not result in absolutely normalized data and normalization must 
be carried out by other methods. A constant scale offset has been shown not to affect 
the structural information in the PDF (Peterson et al., 2003) when it is modeled with 
a scale factor variable, since the relative scaling of peaks to one another within the 
same PDF is preserved. 

Models were fit to the PDFs by refining a variety of parameters as appropriate, such 
as lattice parameters, atomic positions, thermal factors, using the program PDFgui 
(Farrow et al, 2007). In each case, we compare the R w value as well as the values of 
the refined parameters from the PDF obtained using PDFgetX2 and PDFgetX3. 
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3.1. Nickel and Barium Titanate 



First we look at pure nickel (Ni) and barium titanate (BaTiOs) in Figure 2. 
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Fig. 2. PDFs of (a) nickel and (b) barium titanate made with PDFgetX2 (blue) and 
PDFgetX3 ( green) with Qrnax — 

26.0 A" 1 in both cases. Difference curve (offset) is 
in red. The dashed lines represent two standard deviations in the difference curve (r 
values below the nearest neighbor peaks were not included in the standard deviation 
calculation) . 



Both compounds diffract strongly making data corrections less challenging. Fig- 
ure 2(a) shows the two PDFs of nickel plotted on top of one another. In all the figures 
the PDF made with PDFgetX2 is in blue and the PDF made with PDFgetX3 is in 
green. Here the Qmax = 

26.0 A" 1 in both cases. The difference curve between the 
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two PDFs is plotted offset below in red with dashed lines plotted at ±2a as guides to 
the eye, where a is the standard deviation of the difference computed over the range 
above r po i y . We see only very small differences between the two PDFs after the Ni-Ni 
nearest neighbor peak (at r = 2.2 A). We see the same behavior in Figure 2(b) with 
barium titanate. 

The refined parameters from model fits are reproduced in Table 1. In the case of Ni 

only a few parameters may be varied because of the simplicity of the structure. Overall, 

we see very good agreement between most of the parameters and the R w values. There 

are more structural parameters that may be varied in the BaTiC>3 case (Megaw, 1962) 

as reproduced in Table 2. The parameters still agree very well with one another and the 

quality of the fit as measured by R w is the same. We do not report estimated standard 

deviations on the refined parameters since we do not have reliable error estimates for 

the data themselves. The enhancement of PDFgetX3 to propagate uncertainties on 

the data and the problem of extracting reliable uncertainties on integrated powder 

data from 2D integrating detectors are being addressed (Yang & Billinge, 2012), so 

we expect this problem to be resolved in the near future. 

Tabic 1. Comparison of the parameters refined in fitting the Ni model to the PDFs. 



Parameter 


PDFgetX2 


PDFgctX3 


Q damp (A ) 


0.0554 


0.0570 


a = b = c (A) 


3.5239 


3.5237 


52 (A 2 ) 

u iso (A 2 ) 


2.52 


2.71 


0.00612 


0.00564 


R w 


0.0796 


0.0821 



Table 2. Comparison of the parameters refined in fitting the BaTiO^ model to the PDFs. 



Parameter 


PDFgetX2 


PDFgctX3 


Q damp (-^ ) 


0.0485 


0.0491 


a = b (A) 


3.9952 


3.9952 


c(A) 


4.0399 


4.0398 


§2 (A 2 ) 


4.32 


4.37 


C^ll.Ba = ^22, Ba (A 2 ) 


0.00516 


0.00494 


C/33,Ba (A 2 ) 


0.00454 


0.00432 


C^ll.Ti = £^22, Ti (A 2 ) 


0.00874 


0.00839 


c/ 33 .Ti (A 2 ) 


0.0125 


0.0121 


C/n.o = U 2 2,o (A 2 ) 


0.0113 


0.0103 


t/ 33 ,o (A 2 ) 


0.0927 


0.0953 




0.118 


0.121 
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3.2. Nanocrystalline ^-Alumina 

Next, we investigate 7-alumina (AI2O3) using X-rays from a silver anode diffrac- 
tometer (A = 0.56 A). The 7 phase of AI2O3 has a local nanocrystalline structure that 
is different from the structures over longer-range (Paglia et al, 2006). For this reason, 
a new structure model was developed for the local structure of 7-AI2O3 up to r = 8 A 
(ICSD 173014) (Paglia et al, 2006). Figure 3 shows the PDFs of 7-AI2O3 made with 
PDFgetX2 and PDFgetX3 over this region. We see very good agreement between the 
PDFs. In fact, the PDFgetX3 PDF looks better at low r values. 



3 




% 5 10 15 20 

r(A) 



Fig. 3. PDFs of 7-AI2O3 made with PDFgetX2 (blue) and PDFgetX3 (green) with 
Qmax = 20.5 A -1 in both cases. Difference curve (offset) is in red. The dashed 
lines represent two standard deviations in the difference curve (r values below the 
nearest neighbor peaks were not included in the standard deviation calculation). 
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Refined parameters are in Table 3. Unlike in previous cases where we tried to use a 
large r range for our refinement, in this case we refined only over the range r = 1.5— 8 A 
because the model only applies over this range. For this reason, we wanted to refine 
few parameters (this is why XJ{ so was used rather than anisotropic thermal factors). 
Regardless, though, we see very good agreement between the fit results. 

Table 3. Comparison of the parameters refined in fitting the J-AI2O3 model to the PDFs. 



Parameter PDFgetX2 PDFgctX3 



Q damp (-^ ) 


0.0770 


0.0808 


a (A) 


3.3943 


3.3941 


6(A) 


2.7796 


2.7802 


c(A) 


7.0419 


7.0395 


s 2 (A 2 ) 


1.13 


0.991 


u iso , (A 2 ) 


0.0126 


0.0123 


u iso ,Ai (A 2 ) 


0.0148 


0.0145 




0.164 


0.166 



3.3. Cadmium Selenide Nanoparticles 

We now turn our attention to a more challenging class of materials: nanoparti- 
cles which tend to be weakly scattering and more disordered. In Figure 4 we show 
PDFs of three samples of cadmium selenide (CdSe) taken from data published in 
(Masadeh et al, 2007). The bulk CdSe in panel (a) is included for completeness. The 
nanoparticles in panels (b) and (c) were calculated to have diameters of 37 A and 
22 A, respectively (Masadeh et al, 2007). We see that in all three panels of Figure 4, 
the PDFs made with the two programs are almost identical. It was challenging to 
obtain the PDFs from PDFgetX2 requiring considerable care and user intervention 
and parameter tuning. In the case of PDFgetX3 the PDFs shown were produced with 
no more effort than the Ni and BaTi03 PDFs shown above. The low-r region looks a 
little bit different between the PDFs, especially as the size of the nanoparticles gets 
smaller, but we remember that this region contains no physical information. In fact, 
we might even argue that in panel (c), the PDFgetX3 PDF looks cleaner than the 
PDFgetX2 PDF. 
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Fig. 4. PDFs of (a) bulk CdSe and (b) 37 A, and (c) 22 A CdSe nanoparticles 
made with PDFgetX2 (blue) and PDFgetX3 (green) with Q max = 18.0 A" 1 in all 
cases. Difference curve (offset) is in red. The dashed lines represent two standard 
deviations in the difference curve (r values below the nearest neighbor peaks were 
not included in the standard deviation calculation and, for the nanoparticle in panel 
(c), r values larger than 22 A were not included). 



Table 4 contains the refined parameters for the CdSe samples compared to a model 
based on wurtzite (Wyckoff, 1967). Again we see very good agreement between all 
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parameters determined from the getX2 and PDFgetX3 PDFs and the residual, R w , is 
highly comparable between the two pairs of PDFs. 

Table 4. Comparison of the parameters refined in fitting the CdSe wurtzite model to the 







PDFs. 


Parameter 


PDFgctX2 


PDFgetX3 


Bulk 


Q damp (-^ ) 


0.0593 


0.0599 


a = b (A) 


4.2996 


4.2996 


c(A) 


7.0112 


7.0113 


52 (A 2 ) 


3.21 


3.26 


C/ll,Cd = ^22.Cd (A 2 ) 


0.0156 


0.0155 


C/ 3 3,Cd (A 2 ) 


0.0143 


0.0141 


E/ll,Se = C/22.SC (A 2 ) 


0.0129 


0.0128 


E/ 3 3,Se (A 2 ) 


0.0581 


0.0575 




0.114 


0.104 


37 A nanoparticlc 


a = b (A) 


4.2956 


4.2961 


c(A) 


7.0068 


7.0075 


<5 2 (A 2 ) 


4.66 


4.74 


C/ll,Cd = ^22.Cd (A 2 ) 


0.0225 


0.0221 


t/33,Cd (A 2 ) 


0.0302 


0.0302 


C/ll,Se = U 2 2,Se (A 2 ) 


0.0120 


0.0118 


Efo.Se (A 2 ) 


0.199 


0.194 


Particle diameter (A) 


36.39 


35.34 


R w 


0.194 


0.173 


22 A nanoparticle 


a = b (A) 


4.2940 


4.2948 


c(A) 


6.8567 


6.8633 


<5 2 (A 2 ) 


4.97 


5.20 


C/ll,Cd = L/ 2 2.Cd (A 2 ) 


0.0433 


0.0415 


t/33,Cd (A 2 ) 


0.0403 


0.0409 


C/ll, S c = C/ 22 . Sc (A 2 ) 


0.0199 


0.0203 


C/ 3 3,Sc (A 2 ) 


0.233 


0.221 


Particle diameter (A) 


23.13 


23.35 




0.262 


0.265 



3.4- Pharmaceuticals 

The final class of materials that we tested are organic pharmaceutical compounds. 
These materials can be crystalline, as we see in Figure 5(a) and (b), nanostructured 
as in Figure 5(c), or amorphous. These materials tend to have relatively complicated 
crystal structures that are made up of mostly light, organic elements such as hydrogen, 
carbon, and oxygen that do not diffract strongly and even crystal phase pharmaceuti- 
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cal compounds require quite a bit of tinkering in PDFgetX2 to produce a good PDF. 

In the examples here we consider three polymorphs of the drug carbamazepine 
(CBZ), crystalline CBZ form-I and form-Ill as well as melt-quenched carbamazepine 
that turned out to be nanocrystalline (Billinge et al, 2010; Dykhne et al, 2011). 
As with the nanoparticles in Figure 4, the PDFs in Figure 5 made with PDFgetX2 
have relatively large fluctuations from imperfect corrections at low r. This is common 
for weakly scattering samples. However, these were the best PDFs that could be 
obtained using PDFgetX2 at the time of publication. The PDFs made with PDFgetX3 
are highly similar in the physically meaningful region above the nearest neighbor 
separation (C-C bond at 1.4 A) with the added benefit that they appear to be cleaner 
in the unphysical low-r region. This is an advantage because termination ripples from 
large features in the unphysical region may propagate into the physically meaningful 
region of the PDF. 

We did not fit these PDFs to models because new modeling tools need to be devel- 
oped for this class of materials. 
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r(A) 



Fig. 5. PDFs of (a) CBZ-I, (b) CBZ-III, and (c) nanostructured CBZ made with 
PDFgetX2 (blue) and PDFgetX3 (green) with Q max = 20.0 A" 1 in all cases. Dif- 
ference curve (offset) is in red. The dashed lines represent two standard deviations 
in the difference curve, (r values below the nearest neighbor peaks were not included 

lucjrmth© gitaiBdar-d deviation calculation) . 
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4. Summary 

We have described and demonstrated an implementation of the ad-hoc data reduction 
protocol described in (Billinge & Farrow, 2012) in a new Python based software pro- 
gram PDFgetX3. PDFs obtained using this method have been compared with PDFs 
obtained using PDFgetX2, an established program for producing PDFs, and are found 
to be highly similar. Models fit to the PDFgetX2 and PDFgetX3 PDFs yield refined 
parameters that are correspondingly similar. The program has been tested on a range 
of samples from strongly scattering inorganic crystalline powders such as nickel and 
BaTi03 to weakly scattering low atomic number pharmaceutical compounds. The 
program is easy to use compared to PDFgetX2 and rapid, giving PDFs in real-time as 
parameters such as background scale or Q m ax are varied. The program should be good 
for most PDF studies (though does not yield data on an absolute scale) , but will prove 
to be especially useful for high throughput studies such as parametric or time-resolved 
experiments. More information about the program is available at www.diffpy.org. 
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